Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Graph cut-based binarisation of noisy check images

Identifieur interne : 000314 ( Main/Exploration ); précédent : 000313; suivant : 000315

Graph cut-based binarisation of noisy check images

Auteurs : A. Dawoud [États-Unis] ; A. Netchaev [États-Unis]

Source :

RBID : Pascal:13-0017543

Descripteurs français

English descriptors

Abstract

Binarisation of document images with poor contrast, strong noise, complex patterns and variable modalities in the greyscale histograms is a challenging problem. This study proposes an algorithm for the binarisation of noisy check images to extract handwriting text using normalised graph cuts (GCs). The proposed algorithm uses a normalised GC measure as a thresholding principle to distinguish the handwriting characters from the noisy background. The authors propose a factor to penalise extracting objects that do not have the elongated shape of the characters. Improving the structural quality of the characters' skeleton facilitates better feature extraction and classification, which improves the overall performance of optical character recognition (OCR). Experimental results performed on 560 check images showed significant improvements in OCR recognition rates compared to other well-established segmentation algorithms.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Graph cut-based binarisation of noisy check images</title>
<author>
<name sortKey="Dawoud, A" sort="Dawoud, A" uniqKey="Dawoud A" first="A." last="Dawoud">A. Dawoud</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>School of Computing, University of Southern Mississippi</s1>
<s2>Hattiesburg, MS 39406</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">État du Mississippi</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Netchaev, A" sort="Netchaev, A" uniqKey="Netchaev A" first="A." last="Netchaev">A. Netchaev</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>School of Computing, University of Southern Mississippi</s1>
<s2>Hattiesburg, MS 39406</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">État du Mississippi</region>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">13-0017543</idno>
<date when="2012">2012</date>
<idno type="stanalyst">PASCAL 13-0017543 INIST</idno>
<idno type="RBID">Pascal:13-0017543</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000070</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000698</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000076</idno>
<idno type="wicri:doubleKey">1751-9659:2012:Dawoud A:graph:cut:based</idno>
<idno type="wicri:Area/Main/Merge">000317</idno>
<idno type="wicri:Area/Main/Curation">000314</idno>
<idno type="wicri:Area/Main/Exploration">000314</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Graph cut-based binarisation of noisy check images</title>
<author>
<name sortKey="Dawoud, A" sort="Dawoud, A" uniqKey="Dawoud A" first="A." last="Dawoud">A. Dawoud</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>School of Computing, University of Southern Mississippi</s1>
<s2>Hattiesburg, MS 39406</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">État du Mississippi</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Netchaev, A" sort="Netchaev, A" uniqKey="Netchaev A" first="A." last="Netchaev">A. Netchaev</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>School of Computing, University of Southern Mississippi</s1>
<s2>Hattiesburg, MS 39406</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">État du Mississippi</region>
</placeName>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">IET image processing : (Print)</title>
<title level="j" type="abbreviated">IET image process. : (Print)</title>
<idno type="ISSN">1751-9659</idno>
<imprint>
<date when="2012">2012</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">IET image processing : (Print)</title>
<title level="j" type="abbreviated">IET image process. : (Print)</title>
<idno type="ISSN">1751-9659</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Algorithm</term>
<term>Background</term>
<term>Complex variable method</term>
<term>Document image processing</term>
<term>Feature extraction</term>
<term>Graph cut</term>
<term>Gray scale</term>
<term>Histogram</term>
<term>Image contrast</term>
<term>Manuscript character</term>
<term>Noisy image</term>
<term>Optical character recognition</term>
<term>Pattern recognition</term>
<term>Performance evaluation</term>
<term>Segmentation</term>
<term>Signal processing</term>
<term>Skeleton</term>
<term>Threshold detection</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Coupe graphe</term>
<term>Image bruitée</term>
<term>Traitement image document</term>
<term>Contraste image</term>
<term>Méthode variable complexe</term>
<term>Echelle gris</term>
<term>Histogramme</term>
<term>Algorithme</term>
<term>Caractère manuscrit</term>
<term>Détection seuil</term>
<term>Squelette</term>
<term>Extraction caractéristique</term>
<term>Evaluation performance</term>
<term>Reconnaissance optique caractère</term>
<term>Segmentation</term>
<term>Traitement signal</term>
<term>Reconnaissance forme</term>
<term>Arrière plan</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Binarisation of document images with poor contrast, strong noise, complex patterns and variable modalities in the greyscale histograms is a challenging problem. This study proposes an algorithm for the binarisation of noisy check images to extract handwriting text using normalised graph cuts (GCs). The proposed algorithm uses a normalised GC measure as a thresholding principle to distinguish the handwriting characters from the noisy background. The authors propose a factor to penalise extracting objects that do not have the elongated shape of the characters. Improving the structural quality of the characters' skeleton facilitates better feature extraction and classification, which improves the overall performance of optical character recognition (OCR). Experimental results performed on 560 check images showed significant improvements in OCR recognition rates compared to other well-established segmentation algorithms.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>États-Unis</li>
</country>
<region>
<li>État du Mississippi</li>
</region>
</list>
<tree>
<country name="États-Unis">
<region name="État du Mississippi">
<name sortKey="Dawoud, A" sort="Dawoud, A" uniqKey="Dawoud A" first="A." last="Dawoud">A. Dawoud</name>
</region>
<name sortKey="Netchaev, A" sort="Netchaev, A" uniqKey="Netchaev A" first="A." last="Netchaev">A. Netchaev</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000314 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000314 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Pascal:13-0017543
   |texte=   Graph cut-based binarisation of noisy check images
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024